# Common Voice fine-tuning

Wav2vec2 Large Xlsr 53 Hungarian
Apache-2.0
An automatic speech recognition model fine-tuned on the Hungarian Common Voice dataset based on facebook/wav2vec2-large-xlsr-53
Speech Recognition Transformers Other
W
sarpba
17
1
Whisper Uz
Apache-2.0
Uzbek automatic speech recognition model fine-tuned from OpenAI Whisper Medium
Speech Recognition Transformers Other
W
mustafoyev202
110
1
Whisper Uz
Apache-2.0
Uzbek speech recognition model fine-tuned on Whisper Base, trained on the Common Voice dataset
Speech Recognition Transformers Other
W
jamshidahmadov
1,179
3
Whisper Small Uzbek
Apache-2.0
Uzbek automatic speech recognition model fine-tuned from OpenAI Whisper-small on Common Voice 17.0 dataset
Speech Recognition Transformers Other
W
abduaziz
20
2
Whisper Large V3 Turbo Es
MIT
Spanish speech recognition model fine-tuned based on Whisper-large-v3-turbo, achieving a word error rate reduction to 5.34% on the Common Voice 17.0 Spanish dataset
Speech Recognition Transformers Spanish
W
adriszmar
52
4
Whisper Large V3 Az
Apache-2.0
This model is an automatic speech recognition (ASR) model fine-tuned on the Azerbaijani Common Voice 17.0 dataset based on OpenAI's Whisper Large v3, achieving a word error rate (WER) of 1.195%.
Speech Recognition Transformers Other
W
nsalahaddinov
96
1
Whisper Large V3 Turkish Test1
Apache-2.0
A speech recognition model fine-tuned on the Common Voice 17.0 Turkish dataset based on OpenAI Whisper-large-v3
Speech Recognition Transformers Other
W
erdiyalcin
21
3
Whisper Tiny Ru
Apache-2.0
This model is a Russian automatic speech recognition model fine-tuned on the Common Voice 14.0 dataset based on openai/whisper-tiny.
Speech Recognition Transformers
W
whitemouse84
333
1
Training V2
Apache-2.0
A speech recognition model fine-tuned on the Common Voice 11.0 Russian dataset based on OpenAI Whisper-base
Speech Recognition Transformers Other
T
SofiaK
15
1
Speecht5 Finetuned Commonvoice Ru Translit
MIT
A Russian text-to-speech model fine-tuned on the Common Voice 13 dataset based on microsoft/speecht5_tts
Speech Synthesis Transformers Other
S
voxxer
57
2
Speecht5 Tts Common Voice 5 Sv
MIT
A Swedish text-to-speech model fine-tuned based on Microsoft's SpeechT5 architecture, trained using the Common Voice dataset
Speech Synthesis Transformers Other
S
GreenCounsel
27
1
Whisper Medium Turkish 2
Apache-2.0
Turkish speech recognition model fine-tuned based on OpenAI Whisper Medium, trained on the Common Voice 11.0 dataset
Speech Recognition Transformers Other
W
emre
267
15
Whisper Large V2 Hungarian
Apache-2.0
A speech recognition model fine-tuned on the Hungarian Common Voice dataset based on OpenAI Whisper Large-V2
Speech Recognition Transformers Other
W
DrishtiSharma
21
1
Whisper Large V2 Hausa
Apache-2.0
This model is a fine-tuned version of OpenAI's Whisper Large-V2 for Hausa speech recognition tasks, trained on the Common Voice 11.0 dataset
Speech Recognition Transformers Other
W
DrishtiSharma
44
5
Whisper Large V2 Slovenian
Apache-2.0
This model is a speech recognition model fine-tuned on the Common Voice 11.0 Slovenian dataset based on OpenAI's Whisper Large-V2 model, with a word error rate of 13.83%.
Speech Recognition Transformers Other
W
DrishtiSharma
53
1
Whisper Large V2 Bn
Apache-2.0
An automatic speech recognition (ASR) model fine-tuned on Bengali speech datasets based on OpenAI Whisper Large-v2
Speech Recognition Transformers Other
W
anuragshas
319
6
Whisper Large V2 Ta
Apache-2.0
Tamil automatic speech recognition (ASR) model fine-tuned based on OpenAI Whisper Large-v2, achieving 8.45% word error rate on Common Voice 11.0 Tamil test set
Speech Recognition Transformers Other
W
anuragshas
15
1
Whisper Large V2 Malayalam
Apache-2.0
This is a fine-tuned version of the OpenAI Whisper Large V2 model for Malayalam speech recognition tasks, trained using the Common Voice 11.0 dataset
Speech Recognition Transformers Other
W
DrishtiSharma
23
4
Whisper Large Pt Cv11
Apache-2.0
A speech recognition model fine-tuned on the Portuguese Common Voice 11 dataset based on OpenAI's Whisper-large-v2 model
Speech Recognition Transformers Other
W
jonatasgrosman
155
13
Whisper Large V2 Punjabi
Apache-2.0
Punjabi automatic speech recognition model fine-tuned on OpenAI Whisper-large-v2, trained on Common Voice 11.0 dataset
Speech Recognition Transformers Other
W
DrishtiSharma
27
1
Whisper Large V2 Vietnamese
Apache-2.0
This model is an automatic speech recognition (ASR) model based on OpenAI's Whisper Small architecture, fine-tuned on the Common Voice 11.0 Vietnamese dataset
Speech Recognition Transformers Other
W
DrishtiSharma
25
2
Whisper Large V2 Cantonese
Apache-2.0
An automatic speech recognition model fine-tuned on Cantonese dataset based on OpenAI Whisper Large V2, achieving a character error rate of 6.7274% on the test set
Speech Recognition Transformers Other
W
simonl0909
131
12
Exp W2v2t Fr Vp Fr S438
Apache-2.0
A French automatic speech recognition model fine-tuned based on the facebook/wav2vec2-large-fr-voxpopuli model, trained using the Common Voice 7.0 French dataset.
Speech Recognition Transformers French
E
jonatasgrosman
20
0
Exp W2v2t Fr Unispeech S42
Apache-2.0
A speech recognition model fine-tuned using the Common Voice 7.0 (French) dataset, based on the microsoft/unispeech-large-1500h-cv model
Speech Recognition Transformers French
E
jonatasgrosman
20
0
Exp W2v2t It Vp Fr S821
Apache-2.0
An Italian automatic speech recognition model fine-tuned from facebook/wav2vec2-large-fr-voxpopuli, trained using the Common Voice 7.0 Italian dataset
Speech Recognition Transformers Other
E
jonatasgrosman
27
0
Exp W2v2t It Vp 100k S449
Apache-2.0
An Italian automatic speech recognition model fine-tuned from the facebook/wav2vec2-large-100k-voxpopuli model, trained using the Common Voice 7.0 Italian dataset.
Speech Recognition Transformers Other
E
jonatasgrosman
17
0
Exp W2v2t Th Wav2vec2 S664
Apache-2.0
A Thai speech recognition model fine-tuned based on facebook/wav2vec2-large-lv60, trained using the Common Voice 7.0 dataset
Speech Recognition Transformers Other
E
jonatasgrosman
14
0
Exp W2v2t En No Pretraining S289
Apache-2.0
This is a model designed for English speech recognition tasks, based on a randomly initialized wav2vec2 architecture and fine-tuned using the Common Voice 7.0 dataset.
Speech Recognition Transformers English
E
jonatasgrosman
18
0
Victor Hg Ptbr 2.0
Apache-2.0
Portuguese speech recognition model fine-tuned on the common_voice dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition Transformers
V
Vkt
30
0
Wav2vec2 Large Xlsr 53 Cantonese
Apache-2.0
A Cantonese fine-tuned speech recognition model based on facebook/wav2vec2-large-xlsr-53 using the Common Voice corpus version 8.0
Speech Recognition Transformers Other
W
CAiRE
1,214
3
Wav2vec2 Common Voice Tr Demo
Apache-2.0
This model is a speech recognition model fine-tuned on the Turkish Common Voice dataset based on facebook/wav2vec2-large-xlsr-53
Speech Recognition Transformers Other
W
YiTian
30
0
Wav2vec2 Xls R 300m Gn Cv8 4
Apache-2.0
This is an automatic speech recognition (ASR) model fine-tuned on the Common Voice 8.0 dataset based on the facebook/wav2vec2-xls-r-300m model, specifically optimized for the Guarani language (gn).
Speech Recognition Transformers Other
W
lgris
17
0
Output
This model is an automatic speech recognition model fine-tuned on the Abkhaz language dataset, based on the XLS-R architecture
Speech Recognition Transformers Other
O
deepdml
25
0
Wav2vec2 Large Xls R 300m Slovenian
Apache-2.0
An automatic speech recognition model fine-tuned on Slovenian speech datasets based on facebook/wav2vec2-xls-r-300m
Speech Recognition Transformers Other
W
infinitejoy
13
0
Wav2vec2 Xls R 300m Gn Cv8 3
Apache-2.0
An automatic speech recognition (ASR) model fine-tuned on the Guarani (gn) Common Voice 8.0 dataset based on the facebook/wav2vec2-xls-r-300m model
Speech Recognition Transformers Other
W
lgris
17
0
Wavlm Large CORAA Pt Cv7
Apache-2.0
Portuguese automatic speech recognition model based on WavLM-large architecture, fine-tuned on the Common Voice 7.0 dataset
Speech Recognition Transformers Other
W
lgris
15
0
Wav2vec2 Large Xls R 300m Armenian
Apache-2.0
This is an automatic speech recognition model fine-tuned on the Armenian speech dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition Transformers
W
infinitejoy
1,618
0
Xls R Ta
Apache-2.0
Automatic speech recognition model fine-tuned on Tamil dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition Transformers Other
X
jejomi
22
0
Wav2vec2 Large Xlsr Hindi Demo Colab
Apache-2.0
This model is a fine-tuned version of facebook/wav2vec2-large-xlsr-53 on the Common Voice dataset for Hindi speech recognition tasks.
Speech Recognition Transformers
W
nikhil6041
19
0
Wav2vec2 Large Xls R 300m Turkish Colab
Apache-2.0
This model is a speech recognition model fine-tuned on the Common Voice Turkish dataset based on facebook/wav2vec2-xls-r-300m.
Speech Recognition Transformers
W
nimrah
19
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase